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^ a decision problem requires the processors to independently inspect their local neigh- 

borhoods and then collectively decide whether a given global input instance belongs to 
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A central theme in distributed network algorithms concerns understanding and 
coping with the issue of locality. Despite considerable progress, research efforts in this 
direction have not yet resulted in a solid basis in the form of a fundamental compu- 
tational complexity theory. Inspired by sequential complexity theory, we focus on a 
complexity theory for distributed decision problems. In the context of locality, solving 



We consider the standard COCAC model of computation and define LD(t) (for 
local decision) as the class of decision problems that can be solved in t number of com- 
munication rounds. We first study the intriguing question of whether randomization 
helps in local distributed computing, and to what extent. Specifically, we define the 
corresponding randomized class BPLD(i,p, g), containing languages for which there 
exists a randomized algorithm that runs in t rounds and accepts correct instances with 
^ ■ probability at least p and rejects incorrect ones with probability at least q. We show 

$H ' that there exists a language that does not belong to LD(t) for any t = o[n) but which 

belong for BPLD(0,p, q) for any p,q ^ (0, 1] such that p^ -\- q < 1. On the other hand, 
we show that, restricted to hereditary languages, BPLD(t,p, g') = LD(0(t)), for any 
function t and any p,q £ (0, 1] such that p^ + q > 1. 

In addition, we investigate the impact of non-determinism on local decision, and 
establish some structural results inspired by classical computational complexity theory. 
Specifically, we show that non-determinism does help, but that this help is limited, as 
there exist languages that cannot be decided non-deterministically. Perhaps surpris- 
ingly, it turns out that it is the combination of randomization with non-determinism 
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that enables to decide all languages in constant time. Finally, we introduce the notion 
of local reduction, and establish some completeness results. 

Keywords: Local distributed algorithms, local decision, nondeterminism, randomized algo- 
rithms. 



1 Introduction 



1.1 Motivation 

Distributed computing concerns a collection of processors which collaborate in order to 
achieve some global task. With time, two main disciplines have evolved in the field. One 
discipline deals with timing issues, namely, uncertainties due to asynchrony (the fact that 
processors run at their own speed, and possibly crash), and the other concerns topology 
issues, namely, uncertainties due to locality constraints (the lack of knowledge about far away 
processors). Studies carried out by the distributed computing community within these two 
disciplines were to a large extent problem- driven. Indeed, several major problems considered 
in the literature concern coping with one of the two uncertainties. For instance, in the 
asynchrony- discipline, Fischer, Lynch and Paterson [H] proved that consensus cannot be 
achieved in the asynchronous model, even in the presence of a single fault, and in the 
locality- discipline, Linial |28] proved that (A + l)-coloring cannot be achieved locally (i.e., 
in a constant number of communication rounds), even in the ring network. 

One of the significant achievements of the asynchrony-discipline was its success in es- 
tablishing unifying theories in the fiavor of computational complexity theory. Some central 
examples of such theories are failure detectors [6l [7] and the wait-free hierarchy (including 
Herlihy's hierarchy) [18]. In contrast, despite considerable progress, the locality-discipline 
still suffers from the absence of a solid basis in the form of a fundamental computational 
complexity theory. Obviously, defining some common cost measures (e.g., time, message, 
memory, etc.) enables us to compare problems in terms of their relative cost. Still, from a 
computational complexity point of view, it is not clear how to relate the difficulty of prob- 
lems in the locality-discipline. Specifically, if two problems have different kinds of outputs, 
it is not clear how to reduce one to the other, even if they cost the same. 

Inspired by sequential complexity theory, we focus on decision problems, in which one is 
aiming at deciding whether a given global input instance belongs to some specified language. 
In the context of distributed computing, each processor must produce a boolean output, 
and the decision is defined by the conjunction of the processors' outputs, i.e., if the instance 
belongs to the language, then all processors must output "yes" , and otherwise, at least one 
processor must output "no". Observe that decision problems provide a natural framework 
for tackling fault-tolerance: the processors have to collectively check whether the network 
is fault-free, and a node detecting a fault raises an alarm. In fact, many natural problems 
can be phrased as decision problems, like "is there a unique leader in the network?" or "is 
the network planar?". Moreover, decision problems occur naturally when one is aiming at 
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checking the vahdity of the output of a computational task, such as "is the produced coloring 
legal?", or "is the constructed subgraph an MST?". Construction tasks such as exact or 
approximated solutions to problems like coloring, MST, spanner, MIS, maximum matching, 
etc., received enormous attention in the literature (see, e.g., O |25l |26l |28l |30l El |32l 138]). 
yet the corresponding decision problems have hardly been considered. 

The purpose of this paper is to investigate the nature of local decision problems. Decision 
problems seem to provide a promising approach to building up a distributed computational 
theory for the locality-discipline. Indeed, as we will show, one can define local reductions 
in the framework of decision problems, thus enabling the introduction of complexity classes 
and notions of completeness. 

We consider the COCAC model |3S], which is a standard distributed computing model 
capturing the essence of locality. In this model, processors are woken up simultaneously, 
and computation proceeds in fault-free synchronous rounds during which every processor 
exchanges messages of unlimited size with its neighbors, and performs arbitrary computations 
on its data. Informally, let us define LD(if:) (for local decision) as the class of decision problems 
that can be solved in t number of communication rounds in the COCAC model. (We find 
special interest in the case where t represents a constant, but in general we view t as a 
function of the input graph. We note that in the COCAC model, every decidable decision 
problem can be solved in n communication rounds, where n denotes the number of nodes in 
the input graph.) 

Some decision problems are trivially in LD(0(1)) (e.g., "is the given coloring a (A + 1)- 
coloring?", "do the selected nodes form an MIS?", etc.), while some others can easily be 
shown to be outside LD(t), for any t = o{n) (e.g., "is the network planar?" , "is there a unique 
leader?", etc.). In contrast to the above examples, there are some languages for which it is 
not clear whether they belong to LD(t), even for t = 0(1). To elaborate on this, consider the 
particular case where it is required to decide whether the network belongs to some specified 
family J-" of graphs. If this question can be decided in a constant number of communication 
rounds, then this means, informally, that the family J-" can somehow be characterized by 
relatively simple conditions. For example, a family J-" of graphs that can be characterized as 
consisting of all graphs having no subgraph from C, for some specified finite set C of finite 
subgraphs, is obviously in LD(0(1)). However, the question of whether a family of graphs 
can be characterized as above is often non-trivial. For example, characterizing cographs as 
precisely the graphs with no induced P4, attributed to Seinsche |10], is not easy, and requires 
nontrivial usage of modular decomposition. 

The first question we address is whether and to what extent randomization helps. For 
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p,q E (0, 1], define BPLD(t,p, q) as the class of all distributed languages that can be decided 
by a randomized distributed algorithm that runs in t number of communication rounds and 
produces correct answers on legal (respectively, illegal) instances with probability at least 
p (resp., q). An interesting observation is that for p and q such that + g < 1, we have 
LD(t) C BPLD(t,p, q). In fact, for such p and q, there exists a language C G BPLD(0,p, q), 
such that C ^ LD(t), for any t = o{n). To see why, consider the following Unique-Leader 
language. The input is a graph where each node has a bit indicating whether it is a leader 
or not. An input is in the language Unique-Leader if and only if there is at most one leader 
in the graph. Obviously, this language is not in LD(t), for any t < n. We claim it is in 
BPLD(0,p, g), for p and q such that p^ + q <\. Indeed, for such p and g, we can design the 
following simple randomized algorithm that runs in time: every node which is not a leader 
says "yes" with probability 1, and every node which is a leader says "yes" with probability 
p. Clearly, if the graph has at most one leader then all nodes say "yes" with probability at 
least p. On the other hand, if there are at least k > 2 leaders, at least one node says "no" , 
with probability at least 1 — p^ > 1 — p"^ > q. 

It turns out that the aforementioned choice of p and q is not coincidental, and that p'^+q = 
1 is really the correct threshold. Indeed, we show that Unique-Leader ^ BPLD(t,p, g), for 
any t < n, and any p and g such that + g > 1. In fact, we show a much more general result, 
that is, we prove that if + g > 1, then restricted to hereditary languages, BPLD(t,p, g) 
actually collapses into LD(0(t)), for any t. 

In the second part of the paper, we investigate the impact of non-determinism on local 
decision, and establish some structural results inspired by classical computational complexity 
theory. Specifically, we show that non-determinism does help, but that this help is limited, 
as there exist languages that cannot be decided non-deterministically. Perhaps surprisingly, 
it turns out that it is the combination of randomization with non-determinism that enables 
to decide all languages in constant time. Finally, we introduce the notion of local reduction, 
and establish some completeness results. 

1.2 Our contributions 
1.2.1 Impact of randomization 

We study the impact of randomization on local decision. We prove that if + g > 1, 
then restricted to hereditary languages, BPLD(t,p, g) = LD(0(t)), for any function t. This, 
together with the observation that LD(t) C BPLD(t,p, g), for any t = o(n), may indicate 
that + g = 1 serves as a sharp threshold for distinguishing the deterministic case from the 
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randomized one. 

1.2.2 Impact of non-determinism 

We first show that non-determinism helps local decision, i.e., we show that the class NLD(t) 
(cf. Section [273|l strictly contains LD(t). More precisely, we show that there exists a language 
in NLD(0(1)) which is not in LD(t) for every t = o(n), where n is the size of the input graph. 
Nevertheless, NLD(t) does not capture all (decidable) languages, for t = o{n). Indeed we 
show that there exists a language not in NLD(t) for every t = o{n). Specifically, this language 
is #n = {{G,n) I \V{G)\ =n}. 

Perhaps surprisingly, it turns out that it is the combination of randomization with non- 
determinism that enables to decide all languages in constant time. Let BPNLD(0(1)) = 
BPNLD(0(l),p, g), for some constants p and q such that + g < 1. We prove that 
BPNLD(0(1)) contains all languages. To sum up, LB{o{n)) C NLD(0(1)) C NLD(o(n)) C 
BPNLD(0(1)) = All. 

Finally, we introduce the notion of many-one local reduction, and establish some com- 
pleteness results. We show that there exits a problem, called cover, which is, in a sense, 
the most difficult decision problem. That is we show that cover is BPNLD(0(l))-complete. 
(Interestingly, a small relaxation of cover, called containment, turns out to be NLD(0(1))- 
complete). 

1.3 Related work 

Locality issues have been thoroughly studied in the literature, via the analysis of various 
construction problems, including (A + l)-coloring and Maximal Independent Set (MIS) [H 
[51 EHl Eg EHl EQl [35], Minimum Spanning Tree (MST) [l2l [25l [37] , Maximal Matching [19], 
Maximum Weighted Matching [3Tl[32l[lT], Minimum Dominating Set [Ml [27], Spanners [Qj 
[131 [38], 6tc. For some problems (e.g., coloring [3 [231 [35]), there are still large gaps between 
the best known results on specific famihes of graphs (e.g., bounded degree graphs) and on 
arbitrary graphs. 

The question of what can be computed in a constant number of communication rounds 
was investigated in the seminal work of Naor and Stockmeyer [3l]. In particular, that pa- 
per considers a subclass of LD(0(1)), called LCL, which is essentially LD(0(1)) restricted 
to languages involving graphs of constant maximum degree, and involving processor inputs 
taken from a set of constant size, and studies the question of how to compute in 0(1) rounds 
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the constructive versions of decision problems in LCL. The paper provides some beautiful 
general results. In particular, the authors show that if there exists a randomized algorithm 
that constructs a solution for a problem in LCL in 0(1) rounds, then there is also a deter- 
ministic algorithm constructing a solution for this problem in 0(1) rounds. Unfortunately, 
the proof of this result relies heavily on the definition of LCL. Indeed, the constant bound 
constraints on the degrees and input sizes allow the authors to cleverly use Ramsey theory. 
It is thus not clear whether it is possible to extend this result to all languages in LD(0(1)). 

The question of whether randomization helps in decreasing the locality parameter of 
construction problems has been the focus of numerous studies. To date, there exists evidence 
that, for some problems at least, randomization does not help. For instance, [33] proves this 
for 3-coloring the ring. In fact, for low degree graphs, the gaps between the efficiencies of the 
best known randomized and deterministic algorithms for problems like MIS, (A + l)-coloring, 
and Maximal Matching are very small. On the other hand, for graphs of arbitrarily large 
degrees, there seem to be indications that randomization does help, at least in some cases. For 
instance, (A + l)-coloring can be randomly computed in expected 0(logr2) communication 
rounds on n-node graphs [H, |30] , whereas the best known deterministic algorithm for this 
problem performs in 2*^*^^'°^"^ rounds [35]. (A + l)-coloring results whose performances 
are measured also with respect to the maximum degree A illustrate this phenomena as 
well. Specifically, [39] shows that (A + l)-coloring can be randomly computed in expected 
0(log A + v^ogn) communication rounds whereas the best known deterministic algorithm 
performs in 0(A + log* n) rounds [H |23] . 

Recently, several results were established conserving decision problems in distributed 
computing. For example, [5] and [2D] study specific decision problems in the CONQEST 
model. (In contrast to the COCAC model, this model assumes that the message size is 
bounded by O(logn) bits, hence dealing with congestion is the main issue.) Specifically, 
tight bounds are established in [20] for the time and message complexities of the problem of 
deciding whether a subgraph is an MST, and time lower bounds for many other subgraph- 
decision problems (e.g., spanning tree, connectivity) are established in [S]. It is interesting 
to note that some of these lower bounds imply strong unconditional time lower bounds on 
the hardness of distributed approximation for many classical construction problems in the 
COMQSST model. Decision problems have received recent attention from the asynchrony- 
discipline too, in the framework of wait-free computing [T7]. In this framework, the focus is 
on task checkability. Wait-free checkable tasks have been characterized in term of covering 
spaces, a fundamental tool in algebraic topology. 

The theory of proof-labeling schemes [211 122] was designed to tackle the issue of locally 
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verifying (with the aid of a proof, i.e., a certificate, at each node) solutions to problems 
that cannot be decided locally (e.g., "is the given subgraph a spanning tree of the network?", 
or, "is it an MST?"). In fact, the model of proof-labeling schemes has some resemblance 
to our definition of the class NLD(0(1)). Investigations in the framework of proof-labeling 
schemes mostly focus on the minimum size of the certificate necessary so that verification can 
be performed in a single round. The notion of proof-labeling schemes also has interesting 
similarities with the notions of local detection [2], local checking [3], or silent stabilization 
which were introduced in the context of self- stabilization [TO] . 

The use of oracles that provide information to nodes was studied intensively in the context 
of distributed construction tasks. For instance, this framework, called local computation with 
advice, was studied in [TJ] for MST construction and in [TB] for 3-coloring a cycle. 

Finally, we note that our notion of NLD seems to be related to the theory of lifts, e.g.. 



2 Decision problems and complexity classes 
2.1 Model of computation 

Let us first recall some basic notions in distributed computing. We consider the COCAC 
model [36], which is a standard model capturing the essence of locality. In this model, pro- 
cessors are assumed to be nodes of a network G, provided with arbitrary distinct identities, 
and computation proceeds in fault-free synchronous rounds. At each round, every processor 
V G V{G) exchanges messages of unrestricted size with its neighbors in G, and performs 
computations on its data. We assume that the number of steps (sequential time) used for 
the local computation made by the node v in some round r is bounded by some function 
fA{H{r,v)), where H{r,v) denotes the size of the "history" seen by node v up to the begin- 
ning of round r. That is, the total number of bits encoded in the input and the identity of 
the node, as well as in the incoming messages from previous rounds. Here, we do not impose 
any restriction on the growth rate of /a. We would like to point out, however, that imposing 
such restrictions, or alternatively, imposing restrictions on the memory used by a node for 
local computation, may lead to interesting connections between the theory of locality and 
classical computational complexity theory. To sum up, during the execution of a distributed 
algorithm A, all processors are woken up simultaneously, and, initially, a processor is solely 
aware of it own identity, and possibly to some local input too. Then, in each round r, every 
processor v 
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(1) sends messages to its neighbors, 

(2) receives messages from its neighbors, and 

(3) performs at most /^(if(r, f)) computations. 

After a number of rounds (that may depend on the network G and may vary among the 
processors, simply because nodes have different identities, potentially different inputs, and 
are typically located at non- isomorphic positions in the network), every processor v termi- 
nates and outputs some value out(t;). Consider an algorithm running in a network G with 
input X and identity assignment Id. The running time of a node w, denoted T„(G',x, Id), is 
the maximum of the number of rounds until v outputs. The running time of the algorithm, 
denoted T(G', x. Id), is the maximum of the number of rounds until all processors terminate, 
i.e., T(G, X, Id) = max{T^(G, x. Id) | v G V{G)}. Let t be a non- decreasing function of input 
configurations (G,x, Id). (By non-decreasing, we mean that if G' is an induced subgraph 
of G and x' and Id' are the restrictions of x and Id, respectively, to the nodes in G', then 
i(G',x',Id') < t(Cj,x, Id).) We say that an algorithm A has running time at most t, if 
T(G, X, Id) < t{G, X, Id), for every (G, x. Id). We shall give special attention to the case that 
t represents a constant function. Note that in general, given (G, x. Id), the nodes may not be 
aware of t(G, x. Id). On the other hand, note that, iit = t(G,x, Id) is known, then w.l.o.g. 
one can always assume that a local algorithm running in time at most t operates at each 
node V in two stages: (A) collect all information available in BG{v,t), the t-neighborhood, or 
ball of radius t of w in G, including inputs, identities and adjacencies, and (B) compute the 
output based on this information. 

2.2 Local decision (LD) 

We now refine some of the above concepts, in order to formally define our objects of in- 
terest. Obviously, a distributed algorithm that runs on a graph G operates separately on 
each connected component of G, and nodes of a component G' of G cannot distinguish the 
underlying graph G from G'. For this reason, we consider connected graphs only. 

Definition 2.1 A configuration is a pair (G,x) where G is a connected graph, and every 
node V G V{G) is assigned as its local input a binary string x(t;) G {0, 1}*. 

In some problems, the local input of every node is empty, i.e., x(t') = e for every v G V{G), 
where e denotes the empty binary string. Since an undecidable collection of configurations 
remains undecidable in the distributed setting too, we consider only decidable collections of 
configurations. Formally, we define the following. 
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Definition 2.2 A distributed language is a decidable collection C of configurations. 

In general, there are several possible ways of representing a configuration of a distributed 
language corresponding to standard distributed computing problems. Some examples con- 
sidered in this paper are the following. 

Unique-Leader = {(G, x) ||| x ||i< 1} consists of all configurations such that there exists 
at most one node with local input 1, with all the others having local input 0. 

Consensus = {{G, (xi,X2)) | 3m G V{G),Wv G y(G),X2(w) = xi(m)} consists of all config- 
urations such that all nodes agree on the value proposed by some node. 

Coloring = {(Gjx) | Wv G V{G),Ww G N{v),:s.{v) ^ x(u;)} where N{v) denotes the (open) 
neighborhood of that is, all nodes at distance 1 from v. 

MIS = {(G,x) \ S = {v^ V{G) I x(t;) = 1} forms a MIS}. 

SpanningTree = {(G, (name, head)) | T = {e^ = {y,v^^,v G l^(G), head(f ) = name(f"'")} 
is a spanning tree of G} consists of all configurations such that the set T of edges 
e„ between every node v and its neighbor v'^ satisfying name(w"'") = head(t') forms a 
spanning tree of G. 

(The language MST, for minimum spanning tree, can be defined similarly). 

An identity assignment Id for a graph G is an assignment of distinct integers to the 
nodes of G. A node v G V(G) executing a distributed algorithm in a configuration (G, x) 
initially knows only its own identity Id(t') and its own input x(t;), and is unaware of the 
graph G. After t rounds, v acquires knowledge only of its t-neighborhood BG{v,t). In each 
round r of the algorithm A, a node may communicate with its neighbors by sending and 
receiving messages, and may perform at most /^(if(r, f)) computations. Eventually, each 
node V G V{G) must output a local output out(t;) G {0, 1}*. 

Let £ be a distributed language. We say that a distributed algorithm A decides C if and 
only if for every configuration (G,x), and for every identity assignment Id for the nodes of 
G, every node of G eventually terminates and outputs "yes" or "no" , satisfying the following 
decision rules: 

• If (G, x) G C, then out(t') ="yes" for every node v G V{G); 

• If (G, x) ^ £, then there exists at least one node v G V{G) such that out(t;) ="no". 

We are now ready to define one of our main subjects of interest, the class LD(t), for local 
decision. 
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Definition 2.3 Lett be a non- decreasing function of triplets (G,x, Id). Define LD(t) as the 
class of all distributed languages that can be decided by a local distributed algorithm that runs 
in number of rounds at most t. 

For instance, Coloring G LD(1) and MIS G LD(1). On the other hand, it is not hard to 
see that languages such as Unique-Leader, Consensus, and SpanningTree are not in LD(t), 
for any t = o{n). In what follows, we define LD(0(t)) = Uc>iLD(c ■ t). 

2.3 Non-deterministic local decision (NLD) 

A distributed verification algorithm is a distributed algorithm A that gets as input, in ad- 
dition to a configuration (G, x), a global certificate vector y, i.e., every node v of a graph G 
gets as input a binary string x(i;) G {0, 1}*, and a certificate y{v) G {0, 1}*. A verification 
algorithm A verifies C if and only if for every configuration [G, x) , the following hold: 

• If {G, x) G C, then there exists a certificate y such that for every id-assignment 
Id, algorithm A applied on (G, x) with certificate y and id-assignment Id outputs 
out(tO ="yes" for all v G 

• If {G, x) ^ C, then for every certificate y and for every id-assignment Id, algorithm A 
applied on (G, x) with certificate y and id- assignment Id outputs out(f) ="no" for at 
least one node v G V{G). 

One motivation for studying the nondeterministic verification framework comes from settings 
in which one must perform local verifications repeatedly. In such cases, one can afford to 
have a relatively "wasteful" preliminary step in which a certificate is computed for each node. 
Using these certificates, local verifications can then be performed very fast. See [2T| [22] for 
more details regarding such applications. Indeed, the definition of a verification algorithm 
finds similarities with the notion of proof-labeling schemes discussed in |2T1 122]. Informally, 
in a proof-labeling scheme, the construction of a "good" certificate y for a configuration 
{G, x) G £ may depend also on the given id- assignment. Since the question of whether a con- 
figuration (G, x) belongs to a language C is independent from the particular id- assignment, 
we prefer to let the "good" certificate y depend only on the configuration. In other words, 
as defined above, a verification algorithm operating on a configuration (G, x) G £ and a 
"good" certificate y must say "yes" at every node regardless of the id- assignment. 

We now define the class NLD(t), for nondeterministic local decision, (our terminology is 
by direct analogy to the class NP in sequential computational complexity). 
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Definition 2.4 Let t be a non- decreasing function of triplets (G, x, Id). Define NLD(t) as 
the class of all distributed languages that can be verified in at most t communication rounds. 

2.4 Bounded-error probabilistic local decision (BPLD) 

A randomized distributed algorithm is a distributed algorithm A that enables every node v, 
at any round r during the execution, to toss a number of random bits obtaining a string 
r{v) G {0, 1}*. Clearly, this number cannot exceed fA{H{r,v)), the bound on the number of 
computational steps used by node v at round r. Note however, that H{r, v) may now also 
depend on the random bits produced by other nodes in previous rounds. For g G (0, 1], we 
say that a randomized distributed algorithm ^ is a {p,q) -decider for C, or, that it decides 
C with "yes" success probability p and "no" success probability q, if and only if for every 
configuration [G, x), and for every identity assignment Id for the nodes of G, every node of G 
eventually terminates and outputs "yes" or "no" , and the following properties are satisfied: 

• If (G',x) G C, then Pr[out(t;) = "yes" for every node v G V^(G')] > p, 

• If (G',x) ^ C, then Pr[out(t;) = "no" for at least one node v G V'(G)] > q, 

where the probabilities in the above definition are taken over all possible coin tosses per- 
formed by nodes. We define the class BPLD(t,p, g), for "Bounded-error Probabilistic Local 
Decision" , as follows. 

Definition 2.5 Forp,q G (0, 1] and a function t, BPLD(t,p, g) is the class of all distributed 
languages that have a local randomized distributed {p, q) -decider running in time t. (i.e., can 
be decided in time t by a local randomized distributed algorithm with "yes" success probability 
p and "no" success probability q). 

3 A sharp threshold for randomization 

Consider some graph G, and a subset U of the nodes of G, i.e., U C V{G). Let G[U] denote 
the vertex-induced subgraph of G defined by the nodes in U. Given a configuration (G,x), 
let x[?7] denote the input x restricted to the nodes in U . For simplicity of presentation, if 
if is a subgraph of G, we denote x[V^(i7)] by x[i7]. A prefix of a configuration (G, x) is a 
configuration (G[t/], x[f/]), where U C V{G) (note that in particular, G[U] is connected). 
We say that a language C is hereditary if every prefix of every configuration (G, x) G £ is also 
in L. Coloring and Unique-Leader are clearly hereditary languages. As another example 



10 



of an hereditary language, consider a family Q of hereditary graphs, i.e., that is closed under 
vertex deletion; then the language {(G,e) | G G ^} is hereditary. Examples of hereditary 
graph families are planar graphs, interval graphs, forests, chordal graphs, cographs, perfect 
graphs, etc. 

Theorem 13.11 below asserts that, for hereditary languages, randomization does not help 
if one imposes that p"^ + q > 1, i.e, the "no" success probability distribution is at least as 
large as one minus the square of the "yes" success probability. Somewhat more formally, 
we prove that for hereditary languages, we have [Jp2+g>]^ BPLD(t,p, g) = LD(0(t)). This 
complements the fact that for + g < 1, we have LD(t) C BPLD(t,p, q), for any t = o{n). 

Recall that [34] investigates the question of whether randomization helps for constructing 
in constant time a solution for a problem in LCLC LD{0{1)). We stress that the technique 
used in [M] for tackling this question relies heavily on the definition of LCL, specifically, 
that only graphs of constant degree and of constant input size are considered. Hence it is 
not clear whether the technique of [34] can be useful for our purposes, as we impose no such 
assumptions on the degrees or input sizes. Also, although it seems at first glance, that Lovsz 
local lemma might have been helpful here, we could not effectively apply it in our proof. 
Instead, we use a completely different approach. 

Theorem 3.1 Let C be an hereditary language and let t be a function. If C & BPLD(t,p, q) 
for constants p,q E (0, 1] such that + q> 1, then C G LD(0(t)). 

Proof. Let us start with some definitions. Let £ be a language in BPLD(t,p, g) where 
p,q E (0,1] and p"^ + q > 1, and t is some function. Let A be a randomized algorithm 
deciding C, with "yes" success probability p, and "no" success probability q, whose running 
time is at most t(G,x, Id), for every configuration (G, x) with identity assignment Id. Fix a 
configuration (G, x), and an id-assignment Id for the nodes of V{G). The distance distG'(M, v) 
between two nodes of G is the minimum number of edges in a path connecting u and v in 
G. The distance between two subsets Ui,U2 C V is defined as 

distG'(^i, t^2) = min{distG(M, | u G Ui,v G U2}- 

For a set U C V, let S{G, x. Id, U) denote the event that when running A on (G, x) with 
id-assignment Id, all nodes in U output "yes". Let v G V{G). The running time of A at w 
may depend on the coin tosses made by the nodes. Let t^ = tv{G, x. Id) denote the maximal 
running time of v over all possible coin tosses. Note that ty < t{G, x. Id) (we do not assume 
that neither t or ty are known to v). 

The radius of a node v, denoted r^, is the maximum value tu such that there exists a 
node u, where v G Bciujiu). (Observe that the radius of a node is at most t.) The radius 
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of a set of nodes S is := maxjrj, | v G S}. In what follows, fix a constant 6 such that 
0<6<p^ + q — 1, and define A = 11 [logp/log(l — 6)] . 

A splitter of (GjX, Id) is a triplet {S, 1/1,1/2) of pairwise disjoint subsets of nodes such 
that S U UiU U2 = V, distG(^i, U2) > Ar^. (Observe that rs may depend on the identity 
assignment and the input, and therefore, being a splitter is not just a topological property 
depending only on G). Given a splitter (5, Ui, U2) of (G, x. Id), let Gk = G[Uk U S], and let 
Xfc be the input x restricted to nodes in Gk-, for k = 1,2. 

The following structural claim does not use the fact that C is hereditary. 

Lemma 3.2 For every configuration (G, x) with identity assignment Id, and every splitter 
{8,1/1,1/2) o/(G,x, Id), we have 

(^(Gi,xi) e C and (Gz.xz) G (G,x) e £. 

Let (G, x) be a configuration with identity assignment Id. Assume, towards contradic- 
tion, that there exists a splitter {8,1/1,1/2) of triplet (G,x, Id), such that (Gi,xi) G C and 
(G2,X2) G C, yet (G, x) ^ £. (The fact that (Gi,xi) G £ and (G2,X2) G £ implies that both 
Gi and G2 are connected, however, we note, that for the claim to be true, it is not required 
that G[f/i], G[U2] or G[8] are connected.) Let d = Ar^. 

Given a vertex m G S", we define the level of u by i{u) = distG{Ui, {u}). For an integer 
i G [l,d], let Li denote the set of nodes in 8 of level i. For an integer i G {rs, d — rs), let 
Si = Uj=i-rs -^i' finally, for a set J C {rs, d - rs) of integers, let 8j = [j-^j 8i. 

Define 

I = {te {2rs, d - Irs) \ Pr[£(G, x. Id, S,)] < 1 - 5}. 
Claim 3.3 There exists i G {2rs, d — 2rs) such that i ^ I . 

Proof. For proving Claim [373| we upper bound the size of I hj d — 4rs — 2. This is done 
by covering the integers in {2rs, d — 2rs) by at most 4rs + 1 sets, such that each one is 
{4rs + l)-independent, that is, for every two integers in the same set, they are at least 
4rs + 1 apart. Specifically, for s G [1, irs + 1] and m{8) = \{d — 8rs)/{4:rs + 1)] , we define 
Js = {s + 2rs + j{4:rs + 1) | j G [0,m(5')]}. Observe that, as desired, {2rs,d — 2rs) C 
Use[i irs+i] •^•5' each s G [1, 4rs' + 1], Js is (4rs' + l)-independent. In what follows, fix 

s G [l,4r5' + 1] and let J = Js- Since (Gi,xi) G £, we know that, 

Pr[£(Gi,Xi,Id,^jn/)] > p. 
Observe that for i G {2rs, d — 2rs), ty < r^ < rs, and hence, the t„-neighborhood in G of 
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every node v E Si is contained in 5 C d, i.e., Bciv.t^) C d. It therefore follows that: 

Pr[^(G, X, Id, Sjr^i)] = Pr[£(Gi, Xi, Id, Sjr^i)] >p. (1) 

Consider two integers a and b in J. We know that |a — 6| > 4rs + l- Hence, the distance in G 
between any two nodes u E Sa and v G Sb is at least 2rs + 1. Thus, the events S{G, x. Id, Sa) 
and S{G, x. Id, Sb) are independent. It follows by the definition of J, that 

Pr[£:(G,x,Id,^,n/)]<(l-5)l^"'l (2) 

By ([1]) and ({2)), we have that p < {1 - and thus | J fl /| < logp/log(l - 6). Since 

{2rs, d — 2rs) can be covered by the sets Jg, s = 1, . . . , Ars + 1, each of which is (4r5 + 1)- 
independent, we get that 

4rs+l 

|/| = Yl \Jsr]I\ < (4r5 + l)(logp/log(l-r)) . 

s=l 

Combining this bound with the fact that d = Xrs, we get that d — 4rs — l > |/|. It follows by 
the pigeonhole principle that there exists some i G {2rs, d — 2rs) such that z ^ /, as desired. 
This completes the proof of Claim 13.31 □ 

Fix i G {2rs, d — 2rs) such that z ^ /, and let = S{G, x. Id, 5*^). By definition, 

Ft[T] <5<p^ + q-l. (3) 

Let Hi denote the subgraph of G induced by the nodes in (Uj=^/~^ -^i) ^ ^i- similarly 
define H2 as the subgraph of G induced by the nodes in (IJj>j+rg -^i) ^2- Note that 
Si U V{Hi) U V{H2) = V, and for any two nodes u G V{Hi) and v G V{H2), we have 
dciu, v) > 2rs- It follows that, for /c = 1, 2, the t„-neighborhood in G of each node u G V{Hk) 
equals the t„-neighborhood in Gk of m, that is, Bc{u,tu) C Gk- (To see why, consider, for 
example, the case k = 2. Given u G V{H2), it is sufficient to show that $v G V{Hi), such 
that V G Bg{u, tu)- Indeed, if such a vertex v exists then dc^u, v) > 2rs, and hence tu > 2rs- 
Since there must exists a vertex w E Si such that w G B{u,tu), we get that > 2rs, 
in contradiction to the fact that w G S.) Thus, for k = 1,2, since (G'.j,Xi) G £, we get 
Pr[SiG,^,ld,V{H,))] = FT[S{G„^i,ld,V{H,))] > p. 

Let J^' = S{G,^,ld,V{Hi)UV{H2)). As the events £(G, x. Id, ^(iJi)) and x. Id, ^(iJs)) 
are independent, it follows that Pr[J^'] > p^, that is 

Ft[P] <l-p^ (4) 
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By Eqs. (|3]) and (jl]), and using union bound, it follows that PrfJ-" V J^'] < q. Thus 

Pr[^(G,x,Id,\/(G'))] = Pr[£'(G',x,Id,5, Ul/(iJi)Ul/(iJ2))] = Pr[J^ A J^'] > 1 - g . 

This is in contradiction to the assumption that (G, x) ^ C. This concludes the proof of 
Lemma 13.21 □ 

Our goal now is to show that C G LD(0(t)) by proving the existence of a deterministic 
local algorithm D that runs in time 0{t) and recognizes C (No attempt is made here to 
minimize the constant factor hidden in the 0{t) notation.) Recall that both t = t(G',x, Id) 
and tjj = ty{G, x, Id) may not be known to v. Nevertheless, by inspecting the balls Bg{v, 2*) 
for increasing i = 1, 2, ■ ■ ■ , each node v can compute an upper bound on as given by the 
following claim. 

Claim 3.4 Fix a a configuration {G,x), an id- assignment Id, and a constant c. In 0{t) 
time, each node v can compute a value t* = t*(c) such that (1) c ■ t^ < tl = 0(t) and (2) for 
every u G Bg{v, c- t*), we have tu <t*. 

To establish the claim, observe first that in 0{t) time, each node v can compute a value 
t'^ satisfying t^ < t'^ < 2t. Indeed, given the ball Bg{v,2'^), for some integer i, and using 
the upper bound on number of (sequential) local computations, node v can simulate all its 
possible executions up to round r = 2\ The desired value t'^ is the smallest r = 2* for which 
all executions of v up to round r conclude with an output at v. Once t'^ is computed, node 
V aims at computing t*. For this purpose, it starts again to inspect the balls Bg{v,2'^) for 
increasing i = 1, 2, ■ ■ ■ , to obtain t'^ from each u G Bg{v, 2*). (For this purpose, it may need 
to wait until u computes t'^, but this delays the whole computation by at most 0{t) time.) 
Now, node v outputs = 2* for the smallest i satisfying (1) c ■ t'^ < 2* and (2) for every 
u G Bg{v, c ■ 2*), we have t'^ < t*. It is easy to see that for this i, we have 2* = 0{t), hence 

t: = o{t). 

Given a configuration (G,x), and an id-assignment Id, Algorithm D, applied at a node 
u first calculates t* = t*(6A), and then outputs "yes" if and only if the 2At* -neighborhood 
of u in (G, x) belongs to C That is, 

out(n) = "yes" ^ (5c(n, 2At:), x[SG(n, 2At:)]) G C. 

Obviously, Algorithm D is a deterministic algorithm that runs in time 0{t). We claim 
that Algorithm D decides C. Indeed, since C is hereditary, if [G, x) G C, then every prefix 
of (G,x) is also in £, and thus, every node u outputs out(M) ="yes". Now consider the 
case where (G, x) ^ C, and assume by contradiction that by applying D on (G, x) with 
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id-assignment Id, every node u outputs out(n) ="yes". Let U C V{G) be maximal by 
inclusion, such that G[U] is connected and x[t/]) G C. Obviously, U is not empty, as 

(5G(M,2At*),x[_BG'(M,2At*)]) G £ for every node M. On the other hand, we have |[/| < 
because (G, x) ^ C 

Let M G t/ be a node with maximal t„ such that BG{u,2tu) contains a node outside 
U. Define G' as the subgraph of G induced by f/ U 2tu)). Observe that G' is 

connected and that G' strictly contains U. Towards contradiction, our goal is to show that 
(G',x[G']) G C. 

Let H denote the graph which is maximal by inclusion such that H is connected and 

Bg{u,2Q C H C BG{u,2Qu{UnBG{u,2Xt*J) . 

Let W'^, W"^, ■ ■ ■ , be the £ connected components of G[U]\Bg{u, 2tu), ordered arbitrarily. 
Let W'^ be the empty graph, and for k = 0, 1,2, ■■■ define the graph = H U 

U U U ■ ■ ■ U W^. Observe that is connected for each k = 0, 1, 2, ■ ■ ■ , 
We prove by induction on k that (Z^,x[Z'^]) G C for every k = 0,1,2,--- This will 
establish the contradiction since = G'. For the basis of the induction, the case k = 0, 
we need to show that {H,x.[H]) G C However, this is immediate by the facts that if is a 
connected subgraph of Bg{u, 2At*), the configuration {Bg{u, 2At*), x[i?G(^i, 2At*)]) G C, and 
£ is hereditary. Assume now that we have {Z^,:k[Z^]) E C for < k < i, and consider the 
graph Z^'^^ = Z^ U W^~^^. Define the sets of nodes 

S = V{Z'')nV{W''+^), Ui = V{Z'')\S, and U2 = V{W''+^)\S. 

A crucial observation is that {S, Ui, U2) is a splitter of Z''^^. This follows from the following 
arguments. Let us first show that rs < t*. By definition, we have < t*, for every 
V G BG{u,6\tD. Hence, in order to bound the radius of 5* (in Z^~^^) by t* it is sufficient 
to prove that there is no node w E U \ BG{u,6XtD such that BGiw.t^) fl 5 7^ 0. Indeed, 
if such a node w exists then > 4At* and hence Bg{w, 2t^) contains a node outside U, in 
contradiction to the choice of u. It follows that rs < t^. 

We now claim that dist2fc+i(f/i, f/2) > At*. Consider a simple directed path P in Z'''^^ 
going from a node a; G f/i to a node y G 1/2- Since x ^ V(W^~^^) and y G ViW^"^^), we get 
that P must pass through a vertex in BG{u,2tu)- Let z be the last vertex in P such that 
z G BG{u,2tu), and consider the directed subpath P[z,y] of P going from z to y. Now, let 
P' = P[z,y] \{^}- The first d' = min{(2A — 2)t* , |P'|} vertices in the directed subpath P' must 
belong to V{H) C 'V(Z^\ In addition, observe that all nodes in P' must be in 'V{yV^^^\ It 
follows that the first d nodes of P' are in S. Since y ^ S, we get that \P'\ > d' = (2A — 2)t*, 
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and thus \P\ > At*. Consequently, dist2fc+i(?7i, f/2) > At*, as desired. This completes the 
proof that {S, Ui, U2) is a splitter of Z^'^^. 

Now, by the induction hypothesis, we have {Gi, ^[Gi\) G £, because Gi = G[UiUS] = . 
In addition, we have (G2, x[G2]) e £, because G2 = G[U2 U S*] = and W'''^^ is a prefix 

of We can now apply Lemma 13.21 and conclude that (Z^'"^^, x[Z'^'"^^]) G C This 

concludes the induction proof. The theorem follows. □ 

Let Planar = {(G,e) : G is planar}. Interval = {(G, e) : G is an interval graph} and 
CycleFree = {(G, e) : G has no cycle}. One can easily check that neither of these three 
languages is in LD(t), for any t = o{n). Hence, Theorem 13.11 yields the following. 

Corollary 3.5 Letp,q G (0, 1] such thatp^+q > 1, t/ien Unique-Leader, Planar, Interval 
and CycleFree are not in BPLD(t,p, q), for any t = o{n). 

4 Nondeterminism and complete problems 
4.1 Separation results 

Our first separation result indicates that non-determinism helps for local decision. Indeed, 
we show that there exists a language, specifically, tree = {(G, e) | G is a tree}, which 
belongs to NLD(l) but not to LD(t), for any t = o{n). The proof follows by rather standard 
arguments. 

Theorem 4.1 LD(t) C NLD(t), for any t = o{n). 

Proof. To establish the theorem it is sufficient to show that there exists a language 
C such that C ^ LD(o(n)) and C G NLD(l). Let tree = {(G,e) | G is a tree}. We 
have tree ^ LD{o{n)). To see why, consider a cycle G with nodes labeled consecutively 
from 1 to 4n, and the path Pi (resp., P2) with nodes labeled consecutively 1, . . . ,4n (resp., 
2n + 1, . . . , 4n, 1, . . . , 2n), from one extremity to the other. For any algorithm A deciding 
tree, all nodes n+ 1, . . . , 3n output "yes" in configuration (Pi, e) for any identity assignment 
for the nodes in Pi, while all nodes 3r;, + 1, . . . , 4n, 1, . . . , n output "yes" in configuration 
(P2,e) for any identity assignment or the nodes in P2. Thus if A is local, then all nodes 
output "yes" in configuration (G, e), a contradiction. In contrast, we next show that tree G 
NLD. The (nondeterministic) local algorithm A verifying tree operates as follows. Given 
a configuration (G, e), the certificate given at node v is yiy) = distG{v,r) where r G V{G) 
is an arbitrary fixed node. The verification procedure is then as follows. At each node v, A 
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inspects every neighbor (with its certificates), and verifies the following: 

• y(f ) is a non-negative integer, 

• if y{v) = 0, then y{w) = 1 for every neighbor w of v, and 

• if y(f ) > 0, then there exists a neighbor w of v such that y{w) = y{v) — 1, and, for all 
other neighbors w' of v, we have y{w') = y(f ) + 1. 

If G is a tree, then applying Algorithm A on G with the certificate yields the answer "yes" 
at all nodes regardless of the given id-assignment. On the other hand, if G is not a tree, 
then we claim that for every certificate, and every id-assignment Id, Algorithm A outputs 
"no" at some node. Indeed, consider some certificate y given to the nodes of G, and let C 
be a simple cycle in G. Assume, for the sake of contradiction, that all nodes in C output 
"yes". In this case, each node in C has at least one neighbor in G with a larger certificate. 
This creates an infinite sequence of strictly increasing certificates, in contradiction with the 
finiteness of C. □ 

Theorem 4.2 There exists a language C such that C ^ NLD(t), for any t = o{n). 

Proof. Let InpEqSize = {(G',x) | Vt; G V{G), x{v) = \V{G)\}. We show that 
InpEqSize ^ NLD(t), for any t = o{n). Assume, for the sake of contradiction, that there 
exists a local nondeterministic algorithm A deciding InpEqSize. Let t < n/4 be the running 
time of A. Consider the cycle G with 2t + 1 nodes ui,U2, ■ ■ ■ ,U2t+i, enumerated clockwise. 
Assume that the input at each node Ui of G satisfies x(Mj) = 2t + 1. Then, there exists a 
certificate y such that, for any identity assignment Id, algorithm A outputs "yes" at each 
node of G. Now, consider the configuration (C",x') where the cycle G' has 4t + 2 nodes, and 
for each node Vi of G', x'(fj) = 2t + 1. We have (C",x') ^ InpEqSize. To fool Algorithm 
A, we enumerate the nodes in G' clockwise, i.e., G = (wi,f2, ■ ■ ■ ,^44+2). We then define the 
certificate y' as follows: 

y'(^i) = y'{vi+2t+i) = yiui) for z = 1,2, ■ • -2^ + 1 . 

Fix an id-assignment Id' for the nodes in V{G'), and fix z G {l,2,---2t + 1}. There 
exists an id-assignment Idi for the nodes in V{G), such that the output of A at node Vi in 
(C",x') with certificate y' and id-assignment Id' is identical to the output of A at node Ui in 
(C, x) with certificate y and id-assignment Idi. Similarly, there exists an id-assignment Id2 
for the nodes in V{G) such that the output of A at node Wj+2t+i in (C",x') with certificate 
y' and id-assignment Id' is identical to the output of A at node Ui in (C, x) with with 
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certificate y and id- assignment Id2. Thus, Algorithm A at both Vi and f j+2t+i outputs "yes" 
in (C",x') with certificate y' and id-assignment Id'. Hence, since i was arbitrary, all nodes 
output "yes" for this configuration, certificate and id-assignment, contradicting the fact that 
(C",x') ^ InpEqSize. □ 

For p, g G (0, 1] and a function t, let us define BPNLD(t, p, q) as the class of all distributed 
languages that have a local randomized non-deterministic distributed (p, g)-decider running 
in time t. 

Theorem 4.3 Let p,q & (0, 1] such that p"^ + q < 1. For every language C, we have C G 
BPNLD(l,p,g). 

Proof. Let £ be a language. The certificate of a configuration [G, x) G £ is a map of G, 
with nodes labeled with distinct integers in {1, ...,n}, where n = \V{G)\, together with the 
inputs of all nodes in G. In addition, every node v receives the label X{v) of the corresponding 
vertex in the map. Precisely, the certificate at node v is y{v) = (G",x',z) where G' is an 
isomorphic copy of G with nodes labeled from 1 to n, x' is an n- dimensional vector such 
that x'[A(m)] = x(m) for every node u, and i = X{v). The verification algorithm involves 
checking that the configuration (G',x') is identical to (G, x). This is sufficient because 
distributed languages are sequentially decidable, hence every node can individually decide 
whether (G',x') belongs to C or not, once it has secured the fact that ((j",x') is the actual 
configuration. It remains to show that there exists a local randomized non-deterministic 
distributed (p, g)-decider for verifying that the configuration (G",x') is identical to (G',x), 
and running in time 1. 

The non-deterministic [p, g)-decider operates as follows. First, every node v checks that 
it has received the input as specified by x', i.e., v checks wether x'[A(t')] = x(f ), and outputs 
"no" if this does not hold. Second, each node v communicates with its neighbors to check 
that (1) they all got the same map G' and the same input vector x', and (2) they are labeled 
the way they should be according to the map G'. If some inconsistency is detected by a node, 
then this node outputs "no" . Finally, consider a node v that passed the aforementioned two 
phases without outputting "no" . If X{v) ^ 1 then v outputs "yes" (with probability 1), and 
if X{v) = 1 then v outputs "yes" with probability p. 

We claim that the above implements a non-deterministic distributed [p, g)-decider for 
verifying that the configuration (G',x') is identical to {G,x.). Indeed, if all nodes pass the 
two phases without outputting "no", then they all agree on the map G' and on the input 
vector x', and they know that their respective neighborhood fits with what is indicated on 
the map. Hence, (G',x') is a lift of (G,x). If follows that (G',x') = (G',x) if and only 
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if there exists at most one node v E G, whose label satisfies \{v) = 1. Consequently, if 
(G', x') = (G, x) then all nodes say "yes" with probability at least p. On the other hand, if 
(G', x') 7^ (G, x) then there are at least two nodes in G whose label is "1" . These two nodes 
say "yes" with probability p^, hence, the probability that at least one of them says "no" is 
at least 1 — > q. This completes the proof of Theorem 14.31 □ 

The above theorem guarantees that the following definition is well defined. Let BPNLD = 
BPNLD(1, jo, g), for some p,q E (0,1] such that p"^ + q < 1. The following follows from 
Theorems OlOl andOl 

Corollary 4.4 LD(o(n)) C NLD(0(1)) C NLD(o(n)) C BPNLD = All. 
4.2 Completeness results 

Let us first define a notion of reduction that fits the class LD. For two languages £i, C2, we 
say that Ci is locally reducible to C2, denoted by Ci ^ C2, if there exists a constant time 
local algorithm A such that, for every configuration (G,x) and every id-assignment Id, A 
produces out(t;) G {0, 1}* as output at every node v G V{G) so that 

(G, x) G £1 ^ (G, out) G £2 • 

By definition, LD(0(t)) is closed under local reductions, that is, for every two languages 
£i,£2 satisfying £1 ^ £2, if £2 G LD(0(t)) then £1 G LD(0(t)). 

We now show that there exists a natural problem, called cover, which is in some sense 
the "most difficult" decision problem; that is, we show that cover is BPNLD-complete. 
Language cover is defined as follows. Every node v is given as input an element S{v), and a 
finite collection of sets S{v). The union of these inputs is in the language if there exists a node 
V such that one set in S{v) equals the union of all the elements given to the nodes. Formally, 
we define cover = {(G, {S,S)) \ 3v G V{G), 3S G S{v) s.t. S = {S{v) \ v G V{G)}. 

Theorem 4.5 cover is BPNLD-complete. 

Proof. The fact that cover G BPNLD follows from Theorem 14.31 To prove that cover is 
BPNLD-hard, we consider some £ G BPNLD and show that £ ^ cover. For this purpose, we 
describe a local distributed algorithm A transforming any configuration for £ to a configura- 
tion for cover preserving the memberships to these languages. Let (G, x) be a configuration 
for £ and let Id be an identity assignment. Algorithm A operating at a node v outputs a 
pair {S{v),S{v)), where S{v) is the "local view" at v in (G, x), i.e., the star subgraph of G 
consisting of v and its neighbors, together with the inputs of these nodes and their identities. 
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and S{v) is the collection of sets S defined as follows. For a binary string x, let \x\ denote 
the length of x, i.e., the number of bits in x. For every vertex v, let ip{v) 
Node V first generates all configurations (G",x') where G' is a graph with k < ip{v) vertices, 
and x' is a collection of k input strings of length at most ipiv), such that {G',x.') G C. For 
each such configuration (G",x'), node v generates all possible Id' assignments to V{G') such 
that for every node u G V{G'), |Id(u)| < 4j{v). Now, for each such pair of a graph (G',x') 
and an Id' assignment, algorithm A associates a set S* G S{v) consisting of the k = \V{G')\ 
local views of the nodes of G' in (G',x'). We show that {G,x.) G C <^=^ A{G,x.) G cover. 

If (G, x) G C, then by the construction of Algorithm A, there exists a set G S{v) such 
that S covers the collection of local views for (G, x), i.e., S = {S{u) \ u G G}. Indeed, 
the node v maximizing il){v) satisfies '4){v) > max{Id(n) | u G V{G)} > n and '4){v) > 
max{x(M) I u G V{G)}. Therefore, that specific node has constructed a set S which contains 
all local views of the given configuration (G, x) and Id assignemnt. Thus A{G,x.) G cover. 

Now consider the case that A{G,x.) G cover. In this case, there exists a node v and a 
set S E S{v) such that S = {S{u) \ u E G}. Such a set 5* is the collection of local views 
of nodes of some configuration (G',x') G C and some Id' assignment. On the other hand, 
S is also the collection of local views of nodes of the given configuration [G, x) G £ and Id 
assignment. It follows that (G,x) = (G',x') G £. □ 

We now define a natural problem, called containment, which is NLD(0(l))-complete. 
Somewhat surprisingly, the definition of containment is quite similar to the definition of 
cover. Specifically, as in cover, every node v is given as input an element S{v), and a finite 
collection of sets S{v). However, in contrast to cover, the union of these inputs is in the 
containment language if there exists a node v such that one set in S{v) contains the union 
of all the elements given to the nodes. Formally, we define containment = {(G, {S,S)) \ 
3v G V{G), 3S G S{v) s.t. S 3 {S{v) \ v G V{G)}. 

Theorem 4.6 containment is NLD(0(l))-ccimj>/ete. 

Proof. We first prove that containment is NLD(0(l))-hard. Consider some C G NLD(0(1)); 
we show that C ^ containment. For this purpose, we describe a local distributed algorithm 
D transforming any configuration for £ to a configuration for containment preserving the 
memberships to these languages. 

Let t = tc > he some (constant) integer such that there exists a local nondeterministic 
algorithm Ac deciding C in time at most t. Let (G, x) be a configuration for C and let Id be 
an identity assignment. Algorithm D operating at a node v outputs a pair {S{v),S{v)), where 
S{v) is the "t-local view" at v in (G,x), i.e., the ball of radius t around v, BG{v,t), together 
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with the inputs of these nodes and their identities, and S{v) is the collection of sets S defined 
as follows. For a binary string x, let |x| denote the length of x, i.e., the number of bits in x. 
For every vertex v, let ip{v) Node V first generates all configurations (G",x') 

where G' is a graph with m < ip{v) vertices, and x' is a collection of m input strings of length 
at most ip{'^)y such that (G",x') G £. For each such configuration (G',x'), node v generates 
all possible Id' assignments to V{G') such that for every node u G V{G'), |Id(n)| < tp{v). 
Now, for each such pair of a graph (G',x') and an Id' assignment, algorithm D associates a 
set S G S{v) consisting of the m = \V{G')\ t-local views of the nodes of G' in (G",x'). We 
show that {G,x.) G C <^=^ D{G,x.) G containment. 

If (G,x) G C, then by the construction of Algorithm D, there exists a set S" G S{v) such 
that S covers the collection of t-local views for (G,x), i.e., 5* = {S{u) \ u G G}. Indeed, 
the node v maximizing il){v) satisfies ip{v) > max{Id(M) | u G V{G)} > n and il){v) > 
max{x(M) I u G V{G)}. Therefore, that specific node has constructed a set S that precisely 
corresponds to {G, x) and its given Id assignment; hence, S contains all corresponding t-local 
views. Thus, D{G,x) G containment. 

Now consider the case that D{G,x.) G containment. In this case, there exists a node 
V and a set S* G S{v) such that S ^ {S{u) \ u E G}. Such a set S is the collection of 
t-local views of nodes of some configuration (G',x') G C and some Id' assignment. Since 
(G',x') G C, there exists a certificate y' for the nodes of G', such that when algorithm Ac 
operates on (G",x',y'), all nodes say "yes". Now, since S contains the t-local views of nodes 
(G, x), with the corresponding identities, there exists a mapping : (G, x. Id) — j- (G', x'. Id') 
that preserves inputs and identities. Moreover, when restricted to a ball of radius t around 
a vertex v E G, (p is actually an isomorphism between this ball and its image. We assign a 
certificate y to the nodes of G: for each v G V{G), y{v) = y'{(j){v)). Now, Algorithm Ac 
when operating on (G,x, y) outputs "yes" at each node of G. By the correctness of Ac, we 
obtain (G, x) G C 

We now show that containment G NLD(0(1)). For this purpose, we design a nondeter- 
ministic local algorithm A that decides whether a configuration (G, x) is in containment. 
Such an algorithm A is designed to operate on (G,x, y), where y is a certificate. The con- 
figuration (G,x) satisfies that x(t') = {S{v),S{v)). Algorithm A aims at verifying whether 
there exists a node v* with a set S* G S{v*) such that S* D {£{v) \ v G V{G)}. 

Given a correct instance, i.e., a configuration (G,x), we define the certificate y as 
follows. For each node v, the certificate y{v) at v consists of several fields, specifically, 
y{v) = {yc{v),ys{v),yi^{v),yi{v)). The candidate configuration field y^v) is a triplet 
ydv) = (G',x',Id'), where (G',x') is an isomorphic copy (G',x') of (G,x) and Id' is an 
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identity assignment for the nodes of G' . The candidate set field ys{v) is a copy of S**, i.e., 
y^(f) = S* . In addition, let u and u* be the nodes in (G",x') corresponding to v and v*, 
respectively. The candidate identity field Yidiy) is yjrf(f ) = Id'(M), and the candidate leader 
field y;(t;) is y;(t') = Id'(M*). 

We describe the operation of Algorithm A on some triplet (G,x, y). First, each node v 
verifies that it agrees with its neighbors on the candidate configuration and candidate set 
fields in their certificates. That way, if all nodes say "yes" then we know that all nodes hold 
the same candidate configuration which is some triplet (G',x',Id'), and the same candidate 
set S' . Second, each node verifies that £{v) G S' . Also, each node checks that it agrees with 
its neighbors on the candidate leader field in their certificates. I.e, that there exists some 
integer k such that for all nodes v we have y;(^^) = k. Each node v checks that there exists 
a node u* G V{G') such that Id'(M*) = /c, and that there exists a node v' G V{G') such 
that yidiv) = Id'(f'). Moreover, node v verifies that the input x' at v' contains a collection 
of sets S'{v') that contains S' , that is, 5" G S'{v'). Finally, each node v verifies that its 
immediate neighborhood Bg{v, 1) agrees with the corresponding neighborhood of v' in G', 
and that the candidate identities Jidiw) of its neighbors w G Bg{v, 1) are compatible with 
the corresponding identities ld'{w') in G'. We term this verification the neighborhood check 
of V. 

It is easy to see that when applying Algorithm A on a correct instance, together with 
the certificate described above, each node outputs "yes" . We now show the other direction. 
Assume that Algorithm A applied on some triplet (G, x, y) outputs "yes" at each node, 
our goal is to show that (G,x) G C Since all nodes say "yes" on (G, x, y), it follows that 
the certificate y{v) at every node v G V{G) contains the same candidate configuration field 
(G',x',Id'), the same candidate set S' and the same pointer ld'{v') to a vertex v' G G', 
such that S' G <S{v'). Since each node v G V{G) verifies that S{v) G 5", it follows that 
S' ^ {^^(^0 I t-' £ It remains to show that there exists a node v* G V{G) such that 

S' G S{v*). Indeed, this follows by the neighborhood checks of all nodes. □ 

5 Future work 

This paper aims to make a first step in the direction of establishing a complexity theory for 
the locality discipline. Many interesting questions are left open. For example, it would be 
interesting to investigate the connections between BPLD(t,p, g) for different p and q such 
that p^ + q <1. (A simple observation shows that BPLD(t,p, q) C BPLD(t,p^, 1 — (1 — q)''), 
for every integer k. Indeed, given an algorithm with a "yes" and "no" success probabilities 
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p and g, one can modify the success probabilities by performing k runs and requiring each 
node to individually output "no" if it decided "no" on at least one of the runs. In this 
case, the "no" success probability increases from q to at least 1 — (1 — q)^, and the "yes" 
success probability then decreases from p to p''.) Another interesting question is whether 
the phenomena we observed regarding randomization occurs also in the non-deterministic 
setting, that is, whether BPNLD(t,p, g) collapses into NLD(0(t)), for p"^ + q > 1. 

Our model of computation, namely, the COCAC model, focuses on difficulties arising 
from purely locality issues, and abstracts away other complexity measures. Naturally, it 
would be very interesting to come up with a rigorous complexity framework taking into 
account also other complexity measures. For example, it would be interesting to investigate 
the connections between classical computational complexity theory and the local complexity 
one. The bound on the (centralized) running time in each round (given by the function 
/, see Section [2]) may serve a bridge for connecting the two theories, by putting constrains 
on this bound (i.e., / must be polynomial, exponential, etc). Also, one could restrict the 
memory used by a node, in addition to, or instead of, bounding the sequential time. Finally, 
it would be interesting to come up with a complexity framework taking also congestion into 
account. 
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